NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning Fractals by Gradient Descent

https://doi.org/10.1609/aaai.v37i2.25342

Tu, Cheng-Hao; Chen, Hong-You; Carlyn, David; Chao, Wei-Lun (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Fractals are geometric shapes that can display complex and self-similar patterns found in nature (e.g., clouds and plants). Recent works in visual recognition have leveraged this property to create random fractal images for model pre-training. In this paper, we study the inverse problem --- given a target image (not necessarily a fractal), we aim to generate a fractal image that looks like it. We propose a novel approach that learns the parameters underlying a fractal image via gradient descent. We show that our approach can find fractal parameters of high visual quality and be compatible with different loss functions, opening up several potentials, e.g., learning fractals for downstream tasks, scientific understanding, etc.
more » « less
Full Text Available
A SIMPLE INTERPRETABLE TRANSFORMER FOR FINEGRAINED IMAGE CLASSIFICATION AND ANALYSIS

Paul, Dipanjyoti; Chowdhury, Arpita; Xiong, Xinqi; Chang, Feng-Ju; Carlyn, David; Stevens, Samuel; Provost, Kaiya; Karpatne, Anuj; Carstens, Bryan; Rubenstein, Daniel; et al (May 2024, ICLR)

We present a novel usage of Transformers to make image classification interpretable. Unlike mainstream classifiers that wait until the last fully connected layer to incorporate class information to make predictions, we investigate a proactive approach, asking each class to search for itself in an image. We realize this idea via a Transformer encoder-decoder inspired by DEtection TRansformer (DETR). We learn “class-specific” queries (one for each class) as input to the decoder, enabling each class to localize its patterns in an image via cross-attention. We name our approach INterpretable TRansformer (INTR), which is fairly easy to implement and exhibits several compelling properties. We show that INTR intrinsically encourages each class to attend distinctively; the cross-attention weights thus provide a faithful interpretation of the prediction. Interestingly, via “multi-head” cross-attention, INTR could identify different “attributes” of a class, making it particularly suitable for fine-grained classification and analysis, which we demonstrate on eight datasets. Our code and pre-trained models are publicly accessible at the Imageomics Institute GitHub site: https://github.com/Imageomics/INTR.
more » « less
Full Text Available
Discovering Novel Biological Traits From Images Using Phylogeny-Guided Neural Networks

https://doi.org/10.1145/3580305.3599808

Elhamod, Mohannad; Khurana, Mridul; Manogaran, Harish Babu; Uyeda, Josef C.; Balk, Meghan A.; Dahdul, Wasila; Bakis, Yasin; Bart, Henry L.; Mabee, Paula M.; Lapp, Hilmar; et al (August 2023, KDD 2023 Proceedings. 29TH ACM SIGKDD. Conference on Knowledge Discovery and Data Mining.)

Full Text Available

Search for: All records